ISSN (Print) 0023-4001 
ISSN (Online) 1738-0006 

Korean J Parasitol Vol. 51, No. 5: 525-530, October 2013 
m ORIGINAL ARTICLE http://dx.doi.org/10.3347/kjp.2013.5L5.525 

Comparison of Functional Gene Annotation of Toxascaris 
leonina and Toxocara canis using CLC Genomics 

Workbench 



Ki Uk Kim u , Sang Kyun Park 2At , Shin Ae Kang 23 , Mi Kyung Park 23 , Min Kyoung Cho 2 , Ho-jin Jung 4 , 

Kyung-Yun Kim 4 and Hak Sun Yu 2,3 * 

' 'Department of Internal Medicine, and department of Parasitology, School of Medicine, Pusan National University, Yangsan 626-870, Korea; 
3 lmmunoregulatory Therapeutics Group in Brain Busan 21 Project, Yangsan 626-870, Korea; 4 lnsilicogen, Inc., Venture Valley, Suwon 441-813, Korea 

Abstract: The ascarids, Toxocara canis and Toxascaris leonina, are probably the most common gastrointestinal helminths 
encountered in dogs. In order to understand biological differences of 2 ascarids, we analyzed gene expression profiles of 
female adults of T. canis and T. leonina using CLC Genomics Workbench, and the results were compared with those of 
free-living nematode Caenorhabditis elegans. A total of 2,880 and 7,949 ESTs were collected from T. leonina and T. canis, 
respectively. The length of ESTs ranged from 1 06 to 4,637 bp with an average insert size of 820 bp. Overall, our results 
showed that most functional gene annotations of 2 ascarids were quite similar to each other in 3 major categories, i.e., 
cellular component, biological process, and molecular function. Although some different transcript expression categories 
were found, the distance was short and it was not enough to explain their different lifestyles. However, we found distin- 
guished transcript differences between ascarid parasites and free-living nematodes. Understanding evolutionary genetic 
changes might be helpful for studies of the lifestyle and evolution of parasites. 
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INTRODUCTION 

The ascarids, Toxocara canis and Toxascaris leonina, are proba- 
bly the most common gastrointestinal helminths encountered 
in dogs. These ascarids have similar aspects; their genera are 
included in the Family Ascarididae and their morphological 
features are difficult to distinguish from one another by naked 
eyes of a non-professional person [1]. However, they also ex- 
hibit some different biological features. Prenatal transmission 
assures that virtually all puppies are bom infected with T, canis 
[1,2]. This transplacental transmission is observed in the life 
cycle of other Toxocara sp.; it is probably the major route of in- 
fection for kittens with T cati. T. canis can cause a serious dis- 
ease in puppies; however, T leonina is generally less pathogenic 
because they have no prenatal transmission ability. In a young 
host, T canis can migrate to several internal organs (intestine- 
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lung-trachea-intestine) during their juvenile stage for matura- 
tion into adult worms; however, I leonina larvae can mature to 
be adults in the intestinal cavity without tissue migration [1]. 
Finally, T canis can grow into an adult only in canid hosts; 
however, T. leonina can grow to be an adult in both canid and 
felid hosts [1], 

In order to characterize the biology, pathogenesis, and dif- 
ferential diagnosis of the 2 genera, many genetic variation stud- 
ies have been reported [3-6]. Most studies dealt with genetic 
and serologic characteristics for the diagnosis of infection with 
2 parasites. The parasites have similar antigenic characteristics, 
and T amis-infected host serum can be reacted with T leonina 
antigens [3]. However, sequences of cox2 in mitochondrial 
DNA and FTS-2 region of ribosomal DNA of Toxocara and Tox- 
ascaris differed significantly from one another [4,7]. However, 
no study comparing functional gene annotation profiles be- 
tween I canis and T. leonina has been reported. 

Functional genomic suategies, such as the expressed sequence 
tag (EST), are a powerful tool for identification of large num- 
bers of genes [8]. EST analysis is not only one of the more ef- 
fective methods for gene discoveiy, gene expression profiling, 
and functional genome studies, but also one of the more effi- 
cient methods for identification of differential genes [8]. EST 
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analyses of T. canis and I leonina have been reported, respec- 
tively [8- 10] . An efficient functional annotation of DNA or pro- 
tein sequences is a major requirement for successful applica- 
tion of these approaches as functional information on gene 
products is often the key for interpretation of experimental re- 
sults [11]. Recently, a new functional annotation program, Blast- 
2GO according to Gene Ontology (GO) criteria has been in- 
troduced; it is a powerful tool for comparison of functional 
gene expression profiles between 2 genetically similar organ- 
isms [12-14], The GO project provides ontology of defined 
terms representing properties of gene products. The ontology 
covers 3 domains; cellular component (parts of a cell or its ex- 
Uacellular environment), molecular function (elemental activ- 
ities of a gene product at the molecular level, such as binding 
or catalysis), and biological process (operations or sets of mo- 
lecular events with a defined beginning and end, pertinent to 
functioning of integrated living units: cells, tissues, organs, and 
organisms) (http://www.geneontology.org). 

The aim of this study was to understand parasite-specific func- 
tional gene expression profiles of ascarids, and difference of 
gene expression profiles between the 2 ascarids. We analyzed 
gene expression profiles of female adults of T canis and I leo- 
nina using CLC Genomics Workbench and the results were 
compared with those of a free-living nematode species, Cae- 
norhabditis elegans. Conducts of functional genomic studies 
give to us understanding of parasitism, development, metabo- 
lism, and reproduction of these parasites. 

MATERIALS AND METHODS 

Parasite 

Adult T leonina and I canis female worms were collected 
from naturally infected dogs in the South Korea ( = Korea). 
The worms were classified in accordance with the morphologi- 
cal features of the eggs and adults [1], 

Construction of cDNA libraries 

For construction of cDNA libraries, the total RNAs of adult X 
leonina and I canis female worms were purified according to 
the protocol of QIAzol (Qiagen, Valencia, California, USA). Each 
of 5 pg of poly A + mRNA was prepared from the total RNA us- 
ing the absolute mRNA purification kit (Stratagene, La Jolla, 
California, USA). The cDNA libraries were constructed using 
the ZAP Express cDNA synthesis kit according to the manufac- 
turer's instructions (Stratagene, San Diego, California, USA). 



cDNA sequencing and analysis of ESTs 

The sequence files obtained from cDNA of T leonina and T 
canis were initially submitted for pre-processing (Insilicogen, 
Suwon, Korea). This process was trimmed with sequences of 
quality scores 0.05 and ambiguous nucleotides. Each vector 
sequence was identified and trimmed off using the UniVec. 
EST sequences shorter than 100 bp were discarded. EST se- 
quences were assembled and clustered using CLC Genomics 
Workbench. Finally, clusters and singletons were analyzed for 
annotation using a homology search engine, local BLAST 
[15,16] against the NCBI database (National Library of Medi- 
cine, Bethesda, Maryland, USA). Significant matches were de- 
termined when expectation value was less than l e 4 with previ- 
ously reported genes in the NCBI database. The function of 
ESTs was predicted through Clusters of Orthologous Groups 
of proteins (COGs) analysis [17]. Briefly, Uanscription profiles 
were created for the I canis and T. leonina library using tran- 
scripts that matched sequences characterized in 2 other organ- 
isms; C. elegans 23,906 protein sequence and Ascaris lumbricoi- 
des 106,573 nucleotide. These were subjected to further BLAST 
analysis in order to obtain this information and were manual- 
ly scrutinized for determination of the most meaningful anno- 
tation for each EST within the gene ontology (GO) scheme 
(Supplementary Table 1). Results of GO analysis of 2 organ- 
isms were compared to each other, and to those of C. elegans 
in AmiGO (http://amigo.geneontology.org). 

RESULTS 

EST analysis 

A summary of library data is shown in Table 1. From the 
randomly selected plasmids containing DNA inserts, a total of 
2,880 and 7,949 ESTs of T leonina and I canis adult female 
worm cDNA library were sequenced, respectively. The length 
of ESTs ranged from 106 to 4,637 bp with an average insert 
size of approximately 820 bp. 



Table 1 . Statistics of EST of Toxascahs leonina and Toxocara canis 





T. leonina 


T. canis 


Total clones sequenced 


2,880 


7,949 


ESTs submitted for BLAST search 


585 


2,070 


Contigs 


170 


808 


Singletons 


415 


1,262 


No. of EST clusters 


42 


117 



BLAST, basic local alignment and search tool. 
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Gene ontology analysis 

Using the CLC Genomics Workbench (CLCbio, Seoul, Ko- 
rea), UniScripts were compared to the annotations in the Gene 
Ontology Consortium of C. elegans and common worm. GO 
terms were obtained for 1,178 of these UniScripts. Function 
annotation analysis was also used for determination of EC 
number and for identification of COGs and GO-annotated 
proteins. For comparison of biological and functional differ- 
ences, we differentiated ESTs as cellular component, biological 
processing, and molecular function, and Level 2 GO terms for 
enriched UniScripts comprised of 5 or more ESTs. As shown in 
the results, types of proteins expressed in T leonina and I canis 
female adult worms did not differ significantly from each oth- 
er based on GO categories. 

Cellular component level 2 GO terms analyses of 3 
organisms 

We classified the transcripts according to 16 cellular compo- 
nent categories (cell, cell junction, cell part, extracellular ma- 
trix, extracellular matrix part, exrtacellular region, extracellular 
region part, macromolecular complex, membrane, membrane 



part, membrane-enclosed lumen, nucleoid, organelle, organ- 
elle part, synapse, and synapse part). In the cellular component 
category, we found most expressed genes localized in cells (20% 
and 21%, respectively), cell part (20% and 21%), organelle 
(17% and 16%), and organelle part (11% and 11%) of T leo- 
nina and T. canis (Fig. 1). Some difference was observed within 
the extracellular region subcategoiy, which comprised of 5.9% 
of predicted I canis ESTs compared with 3.7% of predicted T 
leonina ESTs. Extracellular regions of subcategories were consis- 
tent with extracellular space and extracellular matrix proteins, 
collagen, and collagen type V protein. However, in level 2 GO 
analyses of C. elegans, the genes showing major expression 
were categorized as membrane and membrane part (meaning 
of 'part' is that the classified genes are related to the category, 
or their ancestor genes are related to the category). The propor- 
tion of cell, cell part, organelle, and organelle part categories of 
C. elegans was lower than those of parasites. 

Biological process level 2 GO terms analyses of 3 
organisms 

The biological process-related transcripts were classified ac- 
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Fig. 1 . Comparative analysis of Cellular Component Level 2 GO terms associated with C. elegans, T. canis, and I leonina enriched Uni- 
Scripts. Cellular Component Level 2 GO terms were derived for stage-enriched UniScripts using CLC Genomics Workbench. Marked 
differences were observed in the types of proteins expressed in C. elegans, T. canis, and T. leonina, most clearly in the cell and organelle 
subcategories. 
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cording to 23 categories (biological adhesion, biological regu- 
lation, cell killing, cellular component organization or biogen- 
esis, cellular process, developmental process, establishment of 
localization, growth, immune system process, localization, lo- 
comotion, metabolic process, multi-organism process, multi- 
cellular organisms process, negative regulation of biological 
process, positive regulation of biological process, regulation of 
biological process, reproduction, reproductive process, response 
to stimulus, rhythmic process, signaling, and single organism 
process). Most biological process level 2 GO terms associated 
with I leonina and T. canis of ESTs were also similar to each 
other. The most highly expressed biological process gene sub- 
categories of 2 parasites were cellular process, developmental 
process, metabolic process, and multicellular organismal pro- 
cess (Fig. 2). However, transcripts of free-living nematodes were 
the most classified as single organism process category, but 
those of parasites were not classified. 

Molecular function level 2 GO terms analyses of 3 
organisms 

Biological process-related transcripts were classified accord- 
ing to 16 categories (antioxidant activity, binding, catalytic ac- 
tivity, cannel regulator activity, electron carrier activity, metallo- 



chaperone activity, molecular transducer activity, morphogen 
activity, nutrient reservoir activity, protein tag, receptor activity, 
receptor regulator, structure molecule activity, translation regu- 
lator activity, and transporter activity). The most highly ex- 
pressed were binding (45% and 48%, respectively), catalytic 
activity proteins (28% and 26%), and structural molecule ac- 
tivity (13% and 11%) of T. leonina and T canis (Fig. 3). How- 
ever, in C. elegans, the most highly expressed was the electron 
carrier activity category (35%), on the other side, those of the 
parasites were consistent with a low proportion (2 and 3%). 
Morphogen activity, nutrient reservoir activity, and receptor ac- 
tivity of C. elegans were higher than those of the 2 parasites. 

DISCUSSION 

Functional annotation of sequence data is a key requirement 
for successful generation of functional genomics in biological 
researches. In this study, we evaluated functional annotation 
of ESTs of T. leonina, T. canis, and C. elegans using the CLC Ge- 
nomics Workbench program, in order to find some genetic 
difference for interpretation of biological differences between 
2 ascarids and a free-living nematode. 

Instead of our expectation, overall, our results showed that 
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Fig. 2. Comparative analysis of Biological Process level 2 GO terms associated with C. elegans, T. canis, and T. leonina enriched UniS- 
cripts. Biological Process Level 2 GO terms were derived for stage-enriched UniScripts using CLC Genomics Workbench. Differences in 
the types of proteins expressed in C. elegans, T. canis, and T. leonina were apparent in several subcategories, including cellular, meta- 
bolic, and developmental processes. 
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Fig. 3. Comparative analysis of Molecular Function Level 2 GO terms associated with C. elegans, T. canis, and T. leonina enriched UniS- 
cripts. Molecular Function Level 2 GO terms were derived for stage-enriched UniScripts using CLC Genomics Workbench. A major di- 
chotomy was evident in the binding and catalytic activity subcategories. 



most functional gene annotations of 2 ascarids were quite sim- 
ilar to one another. We were not surprised by this result because 
they are included in the same Family Ascarididae. In particular, 
comparative analysis of biological process level 2 GO showed 
similar profiles between the 2 ascarids. Most ascarids, includ- 
ing T canis, require internal organ migration periods for growth 
to the adult. However, T. leonina can grow to the adult without 
any migration period [ 1 ] . T. leonina can grow to be adults with- 
out any larval migration; this fact suggests several hypotheses. 
One is that molting and development of I leonina can occur 
easily, compared with T canis; in other words, I leonina might 
have more abundant proteins related to molting and develop- 
ment. The other is that T. leonina does not require certain mol- 
ecules that may be released from host tissues (lung, airway tract, 
blood, etc.) and may be critical for molting and development 
of other ascarids. In the molecular function category, structural 
molecule activity proteins showed greater expression in I leo- 
nina than in T canis. Structural molecule activity category, in- 
cluding motor proteins (actin, myosins, and dyneins), are im- 
portant to molting and development as well as to movement 
[18]. Ubiquitin, heat-shock protein 70, a-actin, and (3-actin are 
overexpressed in the abdominal muscle of marine shrimp Lito- 



penaeus vannamei during the molting cycle [18]. This result 
suggested that perhaps I leonina have more moving activity 
and ability to molt easily compared with T. canis. We cautious- 
ly suggested that these abilities articulated that molting and 
development of T leonina could occur without internal organ 
migration during their life cycle. However, our analyzed data 
were obtained from adult female worms, and the data were 
not sufficiently complete to explain the hypothesis. 

However, we found some difference between free-living 
nematodes and parasites. Membrane and membrane part cat- 
egories were composited with inttinsic to, extrinsic to, and in- 
tegral to membrane proteins. These proteins were more en- 
riched in free-living nematodes than in parasites (Fig. 1). This 
result suggested that membrane complex might have been un- 
complicated during evolutionary periods of parasites. In mo- 
lecular function categories, we found a significantly different 
proportion of subcategories between parasites and free-living 
nematodes. In particular, the election carrier activity category 
showed higher expression in C. elegans than in the other 2 par- 
asites. This category included hydrolase activity, Uansferase ac- 
tivity, and oxidoreductase activity related proteins, which were 
closely related to the metabolic system. These results suggested 
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that free-living nematodes have a more complex metabolic 
system than parasitic nematodes. Most digestive, metabolite 
systems of parasites might have degenerated, compared with 
those of their ancestors during evolutionary periods [1]. The 
most highly expressed transcripts of C. elegans were classified 
as single-organism process category in Biological Process level 
2 GO terms, but a few transcripts of parasites were classified in 
this category. However, some subcategories in the single organ- 
ism process category were also classified in the cellular process 
category. In order to identify the transcripts in this category, we 
have to classify the transcripts according to the subcategory lev- 
el in the single-organism process and cellular process category. 

In this study, although no significant difference was observed 
between I leonina and I amis female adults to explain differ- 
ences in their life cycle, we observed difference of transcript ex- 
pression between parasites and free-living nematodes. Thus, it 
might be helpful to understand evolutionary genetic changes 
of parasites. Analysis of detailed functional individual gene 
study is needed in order to understand parasite-specific tran- 
script expression patterns. 
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